o 
o 

UJ 



CO 



BIOTEC sDLOGY 

SYSTEMS B 
BRANCH ■ 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 0^ /^3S^ 



Source: 

Date Processed by STIC 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: Datin21help@usDto.20v or phone 703-306-4119 (R. Wax) 
PATENTBV 3.0 e-mail help: patin30helD(%usDto.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://www.uspto.gov/web/offices/pac/checker 






Raw Seqpeng^^ingEW' Summary 

BER: 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER: 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA- HEADERS. WHICH WERE INSERTED BY PTO SOFTWARE 



10 



11 



12 



13 



Wrapped Nucleics 



Wrapped Aminos 



Incocrect Line Length 

Misaligned Amino Acid 
Numt>ering 

Non-ASCII 



Variable Length 



Patentin ver. 2.0 "bug" 



Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES) 



Use of n's'or Xaa's 
(NEW f^Lft)* 



Use of <21 3>0rganism 
(NEW RULES) 

Use of <220>Feature 
(NEW RULES) 



Patentin ver. 2.0 "bug" 



The 

betv/een 



The numberAext at the end of each line "wrapped" down to the next line. 
This fnay cK:cuni^QUGfile: wasT^ processor afler creating rt. _ . 

Please adjust yoiiFri^rSirgin to . 3" prevent "Vrapping". 

The amino acid- numberAext at th^ efK i of each line "wrapped " down to the next line. 
This may occur if your fill^^^ retHeV^id^ processor after creeling K. 

Please adjust your right.majgln tp^^ Vrapping". 

The rules require that a line not exceed 72 characters in length. This includes spaces. J/\j^ J Q ^qqj 

numt>ering under each 5th amino acid Is misaligned. This may be caused by tl^e use of tabs 
reen the numbering. It is recommended to delete any tabs and use spacingTet^enifhe numbers. 

This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 

Sequence(s) contain n*s or Xaa's which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may l>e missing. 

A "bug" in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequencers) ■ . Normally, Patentin would automatically generate this section from the_ 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence; 

(2) INFORMATION FOR SEO ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is intentionally skipped 

Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 

Sequence(s) missing. If intentional, please use the following formal for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or J(aa*s have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa. and which residue n or Xaa represents. 



Sequence(s) . 



are missing this mandatpry field or its resporise. 



Sequence(s) are missing the <220>Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213>ORGArJISr/f is "Artificial" or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register." 6/01/98. Vol. 63', No. 104, pp. 29631-32) (Sec. i.823of new Rules) 

Please do not use "Copy to Disk" function of Patentin version 2.0, This causes a corrupted 
~fire*re'?ailing in missing mandat<5fy numeric identifiers and responses (as indicated on raw sequence listing). 
Instead, please use "File Manager" or any other means to copy file to noppy disk. 



A KS- Biotechnology Systems Branch- 5/15/99 
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RAW SEQUENCE LISTING 

PATENT APPLICATION : US/09/4 3 5,274 



DATE: 12/28/2 00 0 
TH-UC: 08:22:34 



Input Set : 
Output Set: 



A:\00100301 .app 

W:\CRF3\12 2 82 0 00\I4 3 52 74 .raw 



'i <:110> APPLf.CAWT: Ci.tovsky, Vir.a.l^ 
4 Rhep, - Yoon 

6 <120> TITLE OF IMVENTION: Genetic Assay i:'or Protein Nuclear Transport 
8 <130> FILK REFERENCE: 001.00301 

10 <140> CURRENT APPLTCATiOW NUMBER: US 09/43!^>,274 

1 1 < 1 4 1 > C URRE N T F i L X NG D ATE : 190 9-11-05 

13 <150> PRIOR APPLICATION NUMBER: US 60/107,^17 

14 <151> PRIOR FILING DATE: 199^-11-06 
1 6 < 1 0 > N U MB E R 0 F S EQ T D N (3S ; 13 

.18 <170> SOFTWARE: Pdtentln Ver . 2.1 
2 0 <:2i0> SEQ ID NO: 1 

21 <211> LENGTH: 61.1 

22 <212> TYPE: DNA 

23 <213> ORGANISM: Unknov^n Organism 
2 5 <2 20> FFJATURE: 

20 <223> OTHER INFOPJ^IATION : Description of rjfjknown Organ ism : bacterial 
28 <220> FEATURE; 

2 9 <2 2 3> OTHER INFORMATION: modiflocl bacterial lexA 

31 <4 00> SEQUENCE: 1 

32 attjaaagcgt taacggccag gcaacaagag gtgtttgatc tcatccgtiga tcacatcagc 60 

3 3 cagacaggta tgccgccgac gcqtgcqgaa atcgcgcagc gtttggggtb cgttccccaa 120 
34 acgcggctqd agaacatctg aaggcgctgg cacgcaaagg cgttattgaa attgtttccg lilO 
3 5 gc^icatcacg cgggattcgt ctgttgcagg aagaggaaga agggttgccg cLggtaggtc 24 0 
36 gtt] Lggctqc cgg tgaacca cttct:ggcgc aacagca I'.a t: tgaaggtcat. tatcaggtcg 300 
3 7 at:.ccttcctt attcaagccg aatgctgatt tcctgctgcg cgtcagcggg atgtcgatga 360 
38 aagatatcgg cattatggat ggtgacttgc tggcagtgca taaaactcag gatgtacgta 420 

3 9 acggtcaggt cgttgtcgca cgtattgatg acgaagttac cgttaagggc cLggaaaaac 4tJ0 
40 acjtjgcaataa agtcgaactg ttgccagaaa atiagcgagtt taaaccaatt: gl.cgttgacc 54 0 
4:t ttcgtcagca gagcttcacc attgaagggc tggcggttgg ggttattcgc aacggcgact 600 

4 2 ggctggaatt c 611 
4 5 <210> SEQ ID NO: 2 

A6 <211> LENGTH: 204 

4 7 <2:t2> TYPE: PRT 

48 <213> ORGANISM: Unknown Organism 

50 <220> FEATURE; 

51 <223> OTHER INFORMATION: Descripti.on ol Unknown Organ ism : bacterial 
53 <220> FEATURE: 

5^1 <223> OTHER INFORMATION: modi fied bacterial lexA 

5 6 <'10 0> SEQUENCE: 2 

5 7 MgL Lys Ala Leu Thr Ala Arg Gin Gin Glu Val Plie Asp Leu Tie Arg 
58 1 5 10 15 

6 0 Asp His He Ser Gin Thr Gly Met Pro Pro Thr Arg Ala Glu He Ala 
61 20 25 30 

6 3 Gin Arg Leu Gly Phe Arg Ser Pro Asn Ala Ala Glu Glu His Ueu Lys 
64 35 40 45 

66 Ala Leu Ala Arg Lys Gly Val He Glu He Val Ser Gly Ala Ser Arg 

67 50 5 5 60 
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RAW SEQUENCE LISTING 

PAT E NT APP L j. C A'J' loV : US/ 0 9 /4 3 5 , 2 7 4 



DATK: .12/28/2000 
TIMI^: 08::^2:34 



input: Set r A:\0010030i.app 









Output 


Set: 


; N:\CRr3\12282000\I435274.raw 






69 


ciy ile 


Arq 




Leu 


Gin 


Glu 


G 1 u 


Glu 


Glu 


(lly 


Leu 


Pro 


Leu 


Val 


Gly 


70 










70 










7 5 










BO 


72 


Arq Val 


Ala 


A 1^1 




Glu 


Pro 


Leu 


Leu 


Al a 


Gin 


Gin 


His 


lie 


Glu 


Gly 


7 3 








8 5 










90 










95 




75 


His Tyr 


Gin 


Val 


Asp 


Pro 


Ser 


Leu 


Phe 


Lys 


Pro 


Asn 


Ala 


Asp 


Phe 


Leu 


76 






10 0 










105 










110 






78 


Leu Ar<) 


Vd], 


Ser 


G .1 V 


Met 


Ser 


Me L 


Lvs 


Asp 


r le 


G 1 y 


1 le 


Met 


Asp- 


Gly 


79 




115 










120 










"1 25 








8'! 


Asp Leu 


Leu 


Al^J 


Val 


H is 


Lys 


llvr 


Gin 


Asp 


val 


A.E.'q 


Asn 


Gly 


Gin 


Val 


82 


130 










135 










140 










34 


Val Val 


Ala 


Arg 


Ile 


Asp 


Asp 


Glu 


Val 


Thr 


Val 


Lys 


Gly 


Leu 


Glu 


Lys 


85 


145 








150 










155 










160 


87 


G.].n Gly 


Asn 


hvs 


Val 


Glu 


Leu 


Leu 


Pro 


Glu 


Asn 


Ser 


Glu 


p]ie 


Lys 


Pro 


H8 








16 5 










17 0 










175 




9 0 


lie Val 


Vdl 


Asp 


Leu 


A.r9 


Gin 


Gin 


Ser 


Phe 


Tlir 


lie 


Glu 


Gly 


Leu 


Ala 


91 






18 0 










185 










190 






93 


Vdl Gly 


Val 


Ile 


Arg 


As 11 


Gly 


Asp 


Trp 


Leu 


Glu 


Phe 










y4 




.195 










200 


















97 


<210> SI 


ID NO 


























98 


<211> LENGTH: 7 


























9 9 


<212> TYPlil: 


PRT 


























TOO <213> ORGANISM 


: Simian 


virus 40 
















:10'. 


! <220> 1 


PEATURE : 



























103 <223> OTHER ItslFORMATION : larye T antigen MLS 

105 <4 00> SraifilNCE: 3 

106 Pro Lys Lys Lys Arg Lys val 

107 1 " ^5 

110 <210> SSQ TD tiO: 4 

111 <211> LENGTH: 15 

112 <212> TYPE: PRT 
1,13 <213> ORGANISM: Xenopus Sp . 

115 <2 20> FEATURIi: 

116 <223> OTHER INFORMATION; nucleoplasmln NLS 
IIB <.400> ^vrvrfr^^- d 

W--> 119 Lys Ar^'pCaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa^Lys Lys Lys Leu 

^ S — "TH 15 



120 1 

12 3 <210> SEQ ID NO; 5 

124 <211> LENGTH: 9 

.125 <212> TYPE: PRT 

126 <213> ORGANISM: Human immunorJeticiency virus type 1 

128 <22 0> FEATURE: 

129 <22 3> OTHER INFORMATION: -Rev protein NES 

131 <400> SEOOENCE: 5 

132 Leu Pro Pro Leu Glu Arg Leu Thr Leu 

133 1 5 

136 <210> Sm 10 NO: 6 

137 <211> LENGTH: 9 

138 <212> TYPE: PRT 

139 <213> ORGANISM: Artifici.al Sequence 



MM 
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RAW SEQUENCE LISTING DATE: .12/28/2000 

PAT1^;NT application : US/0 9/4 3 5, 274 TIME : 08:22:34 



Input Set A:\00100301 . app 

Output Set: H:\CKF3\122 82000\l4 35274.raw 



Ml 


<220> 


FEATURE: 


14 2 


<2 23> 


OTiJ ER I H L^ORi'IA TI ON : Dc s c r i p t i O n 


14 3 




sig ua 1 


14 5 


<220> 


PEATURK : 


14 6 


<223> 


OTIUOR 1NF01U4AT10N : inutnted NHS 


148 


<4 00> 


ShZOUhJNCL : 6 


14 9 


Leu Pro Pro Asp Leu Arq Leu Tlu* Leu 


150 


1 


5 


15 3 


<210> 


SKQ ID NO: 7 


15 4 


<21I> 


LENGTH ; 4 


15 5 


<212> 


Ti'PE : PRT 


15 6 


<2I3> 


ORGANISM; A.r t i t i c i. a l S e q u e n c e 


158 


<220> 


FEATURE : 


159 


<223> 


OTHER INFORMATION: Description 


16 0 




signal 


162 


^220> 


FEATURE ; 


1 6 3 


<2 2 3> 


OTHER INFORMATION: residual NE; 


165 


<4 00> 


SEQUENCE: 7 


16 6 


Leu Pi 


r:o Pro Leu 


16 7 


1 




170 


<210> 


SEO ID NO: 8 


.17:1 


<211> 


LENGTH: 26 


17 2 


<212> 


TYPE; DNA 


17 3 


<213> 


ORGANISM: Artificial Sequence 


175 


<220> 


FEATURE: 


17 6 


<223> 


OTHER INFORMATION; Description 


1 7 8 


<220> 


FEATURE : 


179 


<223> 


OTHER INFORMATION; GAL4 primer 


18.1 


<4 00> 


SEQUENCE: 8 


182 


gggaattcaa ttttaatcaa agtggg 


18 5 


<210> 


SEQ ID NO: 9 


18 6 


<211> 


LENGTH: 27 


18 7 


<212> 


T^PE : DNA 


188 


<213> 


ORGANISM: Artificial Sequence 


19 0 


<2 20> 


FEATURE : 


191 


<223> 


OTHER INFORMATION: Description 


193 


<220> 


FEATURE : 


194 


<223> 


OTHER INFORMAI'ION: GAL4 primer 


196 


<4 00> 


SEQUENCE: 9 


197 


gacggatccc eg gg tat teg atctctt 


200 


<210> 


SEO ID NO; 10 


201 


<211> 


LENGTH : 2 9 


202 


<212> 


'TYPE : DNA 


20 3 


<213> 


ORGANISM: Artificial Sequence 


205 


<220> 


FEATURE : 


206 


<22 3> 


OTHER INFORMATION: Description 


208 


<220> 


FEATURE: 


209 


<223> 


OTHER INFORMATION: GAL4 primer 


211 


<400> 


SEQUENCE; 10 



of A.rtil:lcial Sequence ; nuclear export 
of pNEAMlO 

o f A r t i f .i. c -i . a 1 S e q u e n c: e : n u c 1 e a r expo r t 
of pNEARev(delta ) 3 

of Artificial Sequence : primer" sequence 

26 

of Art.i.ficial Sequence : primer sequence 

27 

of Artificial Sequence ; pr.imer sequence 
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RAW SEQUENCE LIS'll-TNG DA'i'J? : :L2/28/200G 

P AT K N 'r KP P L I C AT.V 6 K : " US / 0 9 /4 3 5 , 2 7 4 ' T I ME : 0 B : 2 2 : 3 4 



Input: Set 
Output Set 



'A:\00i00301.api3 
N:\CKF3\I2282000\I4 3 527 4 . raw 



212 ciygaaLtctja Uaaagcggaa ttaattccc 29 
-215 <210> -SEQ 10 NO: 11 

2 16 <2;n> LENGTH: 27 
217 <2.12> TYPI-]; DNA 

213 <213> ORGANISM: Artificial Sequence ■ 
22 0 <220> FEA'IUJRE: - 

2 21 <2 2 3> ^ OTJ:f GK 1 NlTOimA'r lOjtJ : Desc ript i on o'f Ar t i f ic ia 1 Seque ace : pr inie r sequence 
22.3 <220> FfiJATaRK: " . „ 

224 <223> OTHEH INFOfy^ATXON : GAL') primer 

226 <4 00> SEQUKHCE: IX - ■ ■ - 

227 cjactjciatecc cyggtattcg atcr.ctr. ..27 

230 <210> 3EQ ID NO: iS 

231 <2:il> LKIIGTH : 25 

232 <:212> TYPJ3: DMA 

235 <213> ORGANISM: ArtiCiciai Sequence 

235 <220> PBATURK: ' ' 

236 <223> OTHBR INrORMATlOW: Description of Artificial Sequence : primer sequence 
2 38 <220> PICATURE; 

239 < 2 2 3 > OTHER IN FO RM AT 1 0 N : in u t a n t le X A primer 
241 <10,0> SECHJENCE: 12 

24 2 cc<jttaa(j<ig cctqqaaaaa cacjgcj 2 5 

24 5 <210> SEQ TD NO: 13 
24G <2XI> LENGTH: 26 

24 7 <212> TYPE: DNA ; . 

2 4 B < 2 1 3 > ORGAN I S M ; Ar t .i f i c i a 1 S e q u e n c e 
,230 <220> FEATURE: 

'251 <223> OTHER INFORMATION: Descrlptidh* of Artificial Sequence : primer sequence 

253 <2 20> FEATURE: 

254 <223> OTHER INFORMATION: selection le:':A primer 

256 <4O0> SEQUENCE: 13 

257 gtgacticjgtcj aygcctcaac ca agtc 26 
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12/2S/00 



Page 5 of 6 



VERIFICATION SUMMi^RY DATIi! : :t 2/2 8/2000 

^ - PATl^N^P- ;VP-PLXCA^^:ON : US/09/4 3-5,2-7-4 ^" " TTME 0 8 : 2-2T3"5 """ 

Input Set : *'a:\0 0100301. app 

Oil t pii t So t : N : \C RF 3 \ 1 2 2 8 2000\I435274. raw 

L;.119 M:25B W: MaiuTatory Pea Lure miss in <221> not found for SEQ I0#:4 
'L:119 M:25B W: Mandatory Feature missing, <222> not: found for SEQ IDfM.4 
L:119 M:340 W: (46) "n" or "Xaa'' used: Feature required, for SKQ IDjf : 4 
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